DNA encoding a protein comprising calmodulin-and actin-binding human caldesmon peptide fragment

ABSTRACT

Provided are functional peptide fragments of human caldesmon having calmodulin and actin-binding activity, the amino acid sequences and nucleic acid sequences therefor.

This is a divisional application of Ser. No. 08/285,440, filed Aug. 4, 1994, now issued as U.S. Pat No. 5,532,337 which is a continuation of now abandoned application Ser. No. 07/858,947, filed Mar. 27, 1992.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to a polypeptide that has the amino acid sequence of the functional domain of human caldesmon.

2. Description of Related Art

Caldesmon is a protein that can bind with calmodulin, actin and tropomyosin, found in all tissues except skeletal muscle and cardiac muscle. It contributes to the regulation of actomyosin system in smooth muscle. This regulatory function depends on the concentration of calcium ions (flip-flop regulation) in bringing about the regulation of the actomyosin system (Proc. Natl. Acad. Sci. USA, 78, 5652-5655, 1981). When the concentration of calcium ions is low, caldesmon binds with the actin filament-tropomyosin system, and inhibits the mutual interactions of actin and myosin. When the calcium ion concentration is increased, complexes of active calmodulin and caldesmon are formed. This complex is released from the actin filament-tropomyosin system. Because of this, the inhibition of actin-myosin interaction caused by caldesmon cease, and these interactions begin.

Caldesmon was first isolated from the smooth muscles of chicken gizzards, and it has been isolated from other vertebrates. The results of the limited digestion of chicken caldesmons by the protease α-chymotrypsin have shown that a polypeptide of the size of about 35 kDa at the carboxy-terminal end of chicken caldesmons is the functional unit that brings about flip-flop regulation that depends on the concentration of calcium ions (J. Biochem., 102, 1065-1073, 1987).

There are two isoforms of chicken caldesmon that have different molecular weights. The results of electrophoresis on sodium dodecyl sulfate-polyacrylamide gels have shown that one is of high molecular weight (120,000 to 150,000 Da), now named h-caldesmon, and that the other is of low molecular weight (70,000 to 80,000 Da), now named l-caldesmon. h-caldesmon is abundant in smooth-muscle tissue, and l-caldesmon is abundant in non-muscle tissue and cultured cells. The amino acid sequence of h-caldesmon and the DNA sequence that codes for it have recently been identified (Biochem. Biophys. Res. Commun called BBRC below!, 164, 503-511, 1989), as have been the amino acid sequence of l-caldesmon and the DNA sequence that codes for it (j. Biol. Chem., 266, 355-361, 1991; also Japanese Patent Application 2-37362).

Both h- and l-caldesmons are of chicken origin, and the amino acid sequence of human caldesmon is still unknown.

The morphological change of cells is one characteristic of carcinogenesis. The preservation of cell morphology is closely related to the actin network of the cells, and when cells are transformed, the change in cell morphology occurs in correlation with loss of actin cables, which are part of the filament system that makes up the cytoskeleton (Proc. Natl. Acad. Sci. USA, 72, 994-998, 1975). Immunohistochemical analysis have revealed that caldesmon is localized in actin cables, cell attachment sites, and membrane ruffles in normal cells, but in cancer cells, the compound is not found in any defined cell location (Proc. Natl. Acad. Sci. 81, 3133-3137, 1984).

When cultured cells are transformed with a variety of oncogenic viruses, the caldesmon concentration of the cells decreases and the phosphorylation of the caldesmon increases (Proc. Natl. Acad. Sci. USA 81, 3133-3137, 1984, and Saibo Kogaku Cell Technology!, suppl. 3, "Molecules regulating calcium signalling," 140-153, 1987).

Problems to be solved by this invention

As described above, there seems to be a close relationship between caldesmon content and the change in morphology of cells that have been transformed. If the relationship were understood, it might be possible to use human caldesmon for treatment of cancer. In addition, if the DNA sequence that codes for human caldesmon were known, it would be possible to use it for the diagnosis of cancer. Further, if it were possible to isolate a polypeptide that has activities of the actin-binding functional unit, the tropomyosin-binding functional unit, and the calmodulin-binding unit, this polypeptide could be used to inhibit or accelerate the mutual interactions between actin and myosin whatever the calcium concentration is, so that the polypeptide could be used as a vasodilator or a regulator of the movements of the digestive tract.

The purpose of this invention is to provide a polypeptide that has the activity of human caldesmon and the DNA sequence that codes for said polypeptide.

SUMMARY OF THE INVENTION

To summarize this invention, the first part of this invention relates to a polypeptide that has a calmodulin-binding activity and an actin-binding activity characterized by said polypeptide has an amino acid sequence of SEQ ID NO:1 in the Sequence Listing in a polypeptide molecule. The second part of this invention relates to a gene that codes for the polypeptide of the first part of this invention.

The polypeptide of this invention is a polypeptide that has both calmodulin-binding activity and actin-binding activity; it is, for example, a polypeptide that has within it the amino acid sequence shown as SEQ ID NO:1 in the Sequence Listing, which sequence is that of the functional unit of human caldesmon, which unit was discovered by the inventors of this invention. Polypeptides that have within them this amino acid sequence include, for example, the polypeptides that have the sequence of SEQ ID NO:2 to 5 of the Sequence Listing.

A gene that codes for the polypeptide of artificial human caldesmon is, for example, a gene with the DNA sequence of SEQ ID NO:7 to 12 in the Sequence Listing, and genes that can hybridize under stringent conditions to the gene with the SEQ ID NO:7 to 12, and which genes code for a polypeptide that has calmodulin-binding activity and actin-binding activity.

The inventors of this invention prepared a cDNA library from HeLa cells, a human cell line, and next selected cDNA clones that code for a polypeptide of human caldesmon from this cDNA library; the DNA sequence and the amino acid sequence coded for human caldesmon were deduced by DNA sequence analysis. Next, based on various finding, the inventors prepared polypeptides of various lengths, and looked for the functional domain of human caldesmon polypeptide, using calmodulin-binding activity and actin-binding activity as indices. Based on the findings described below, the inventors achieved this invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a restriction map of the cDNA that codes for type I (SEQ ID NO:6) and type II (SEQ ID NO:5) caldesmon (CaD).

FIG. 2 is a figure that shows the relationships of type I and type II caldesmon, together with the CaD fragments of the present invention comprising SEQ ID NO:1. These CaD fragments are fragments 312AA (SEQ ID NO:3), 118AA (SEQ ID NO:2), 116AA (SEQ ID NO:1), 94AA (SEQ ID NO:22), 90AA (SEQ ID NO:25), and 88AA (SEQ ID NO:27).

DETAILED DESCRIPTION OF THE INVENTION

Below, this invention is explained in detail.

The gene that codes for human caldesmon can be isolated and analyzed as described here. To obtain the gene of human caldesmon, HeLa cells or the like can be used, and the total RNA, which includes the poly(A)⁺ RNA, of the cells, was isolated, and bound onto a cellulose carrier or the like with oligo(dT). This was used as a template in the synthesis of cDNA with reverse transcriptase. The cDNA was synthesized by the method of Okayama-Berg or by the method of Gubler-Hofmann. The cDNA synthesized in this was ligated with a plasmid or phage vector and introduced into a host, giving a cDNA library.

The cDNA library was screened for the desired clones by one of the two following methods. The amino acid sequence of a portion of the desired protein was identified, and an oligonucleotide that coded for that amino acid sequence was synthesized and used as a probe in screening. Or, an antibody that bound with the desired protein was prepared and used to screen the cDNA library constructed in an expression vector, in an immunological method.

When the immunological method of screening is used, λgt11 is well used as the expression vector. λgt11 is commercially available; it can be purchased from, for example, Stratagene. When λgt11 is used in immunological screening, first, the library is amplified on agar plates, and nitrocellulose or nylon filters treated with isopropyl-β-D-galactopyranoside (IPTG) are placed on these plates and expression is induced, after which the protein being expressed adsorbes onto the filter. The filter is immersed in buffer that contains bovine serum albumin (below, referred to as the blocking buffer), and after this step of blocking, antibodies (first antibody) that bind with the desired protein are added to fresh blocking buffer, and the filter is immersed in this blocking buffer, so that the desired protein and the antibodies form complexes. After the filter is washed, it is immersed in a blocking buffer that contains the second antibody, labelled with an enzyme. The first antibody and the second antibody form complexes. Second antibodies coupled to alkaline phosphatase or peroxidase are commercially available. The filter is washed to remove excess second antibody that has not adsorbed to the filter, and the filter is put in a developing solution, and colored clones are selected. The recombinant phages obtained in this way are used to infect host Escherichia coli cells, and phage DNA is obtained from the phage lysate. The cDNA from the recombinant phage DNA is isolated and purified, and its DNA sequence is identified.

The antibodies used for immunological screening can be antisera prepared in rabbits immunized with other kinds of caldesmon, such as chicken caldesmon that have amino acid sequences likely to correspond in part to the sequence of the desired protein, human caldesmon.

The cDNA sequence coding for human caldesmon, which has been identified by the inventors, is SEQ ID NO:13 and 14 in the Sequence Listing. Of the two forms of human caldesmon, the 558 amino acid residues shown as SEQ ID NO:6 in the Sequence Listing is that of the caldesmon of high molecular weight, and the DNA sequence that codes for this is SEQ ID NO:12 of the Sequence Listing. The caldesmon of low molecular weight is exactly the same as that of high molecular weight except that it lacks amino acids 202 to 227, and it is shown as the 532 amino acids of SEQ ID NO:5 in the Sequence Listing, and the DNA sequence that codes for this is SEQ ID NO:11 in the Sequence Listing. Here, the form of higher molecular weight is called type I, and the form of lower molecular weight is called type II.

To obtain plasmids that express the polypeptides of human caldesmons type I and type II, DNA that codes for the caldesmon of type I and DNA that codes for the caldesmon of type II are prepared separately from the recombinant phage DNA mentioned above, and the DNA is inserted into an expression plasmid, such as the plasmid pTV118N. The plasmid that coded for the caldesmon of type I was designated pTV118NHS3CaD1, and the plasmid that coded for the caldesmon of type II was designated pTV118NHS3CaD2.

With pTV118NHS3CaD1, it is possible to prepare DNA that codes for polypeptides of various lengths. For example, a site slightly upstream of the initiation codon of the region that codes for type I caldesmon of this plasmid can be cleaved with an appropriate restriction enzyme, and exonuclease can be used to remove the sequence of the 5'-side in the caldesmon gene. By changes in the reaction conditions, it is possible to prepare genes that code for polypeptides of various chain lengths that are missing the N-terminal portion of type I caldesmon. Alternatively, it is possible to amplify DNA of various lengths by the polymerase chain reaction (PCR: Saiki et al., Science, 230, 1350-1354, 1985) with a number of primers synthesized and pTV118NHS3CaD1 as a template, and said DNA can be used to prepare polypeptides of various lengths. As described in detail, with the SEQ ID NO:15 and 16 in the Sequence Listing as primers, PCR can be done (at 94° C. for 30 sec in step 1, at 55° C. for 2 min in step 2, and at 72° C. for 1 min in step 3, for a total of 20 cycles), which results in the amplification of DNA that contains the DNA sequence shown as SEQ ID NO:10 in the Sequence Listing, which codes for the polypeptide shown in the Sequence Listing as having SEQ ID NO:4 (called 312AA below). With the sequences of SEQ ID NO:17 and 18 in the Sequence Listing as primers, it is possible to amplify DNA that contains the DNA sequence shown as SEQ ID NO:9 in the Sequence Listing, which codes for the polypeptide shown in the Sequence Listing as having SEQ ID NO:3 (called 122AA below). With the sequences of SEQ ID NO:18 and 19 in the Sequence Listing as primers, it is possible to amplify DNA that contains the DNA sequence shown as SEQ ID NO:8 in the Sequence Listing, which codes for the polypeptide shown in the Sequence Listing as having SEQ ID NO:2 (called 118AA below). With the SEQ ID NO:20 and 21 in the Sequence Listing as primer, it is possible to amplify DNA that contains the DNA sequence shown as SEQ ID NO:23 in the Sequence Listing, which codes for the polypeptide shown in the Sequence Listing as having SEQ ID NO:22 (called 94AA below). In addition, with the sequences of SEQ ID NO:21 and 24 in the Sequence Listing as primers, it is possible to amplify DNA that contains the DNA sequence shown as SEQ ID NO:26 in the Sequence Listing, which codes for the polypeptide shown in the Sequence Listing as SEQ ID NO:25 (called 90AA below). Next, these DNAs are inserted into expression plasmids, such as, for example, pTV118N. The plasmid that coded for 312AA was named pTHCD247, the plasmid that coded for 122AA was named pTHCDXa443, the plasmid that coded for 118AA was named pTHCD443, the plasmid that coded for 94AA was named pTHCDXa451, and the plasmid that coded for 90AA was named pTHCD451.

The plasmids were used to transform Escherichia coli cells, such as E. coli JM109, for their expression, and the recombinant cells were cultured under appropriate conditions so that the desired polypeptides accumulated within the E. coli cells. The purification of the polypeptides was, for example, as follows. The recombinant E. coli cells were cultured in a culture medium such as L broth, and the cells were harvested and disrupted by being sonicated. The disrupted cells were centrifuged and the supernatant was obtained. After treatment such as dialysis, the dialysate was put on a column for gel filtration and then on a column for ion-exchange chromatography for purification. The desired proteins were purified by use of affinity columns coupled with calmodulin or tropomyosin, making use of the properties of the target protein. 122AA and 94AA have a sequence that can be recognized by factor Xa. By cleaving with factor Xa site-specifically, it is possible to prepare a polypeptide shown as SEQ ID NO:1 in the Sequence Listing (called 116AA below) and a polypeptide shown as SEQ ID NO:27 in the Sequence Listing (called 88AA below).

The polypeptides to which this invention is related are shown in FIG. 2. Type II is a polypeptide that is missing one portion of type I. 312AA is a polypeptide that is the C-terminal portion of types I and II, 122AA and 118A are polypeptides that have a sequence of 116 amino acids in the C-terminal sequence of 312AA, and 94AA and 90AA are polypeptides that have a sequence of 88 amino acids that are in the central portion of 118AA, lacking both the N-terminal portion and the C-terminal portion. On the N-terminal of 118AA, there are residues Met-Ala that originated from a PCR primer, and on the N-terminal of 90AA, there are residues Met-Ala that also originated from a PCR primer. On the N-terminal of 122AA and 94AA, there are residues Met-Ala that originated from PCR primer and a sequence that can be recognized by factor Xa.

it is possible to measure the calmodulin-binding activity of these polypeptides by use of their affinity for calmodulin during calmodulin-affinity chromatography on a column, and it is possible to measure the actin-binding activity of these polypeptides by the method of coprecipitation with actin. As shown in Table 1, the polypeptide (type I, type II, 312AA, 122AA, 118AA, and 116AA) that contains the polypeptide of 116 amino acids (called 116AA below) that has SEQ ID NO:1 in the Sequence Listing has both calmodulin-binding activity and actin-binding activity.

                  TABLE 1                                                          ______________________________________                                                      calmodulin-binding                                                                          actin-binding                                        polypeptide  activity     activity                                             ______________________________________                                         type I       +            +                                                    type II      +            +                                                    312AA        +            +                                                    122AA        +            +                                                    118AA        +            +                                                    116AA        +            +                                                    94AA         +            -                                                    90AA         +            -                                                    88AA         +            -                                                    ______________________________________                                          (Activity is shown as +, and lack of activity as -.)                     

Type I, type II, 312AA, 122AA, 118AA, 116AA all have tropomyosin-binding activity and have a inhibitory activity to actomyosin ATPase, and so the functional unit of human caldesmon is identified as being 116AA.

It is possible to construct a DNA that codes for 116AA out of pTHCD443 as needed by the method of site-specific mutagenesis, and the DNA can be used to express 116AA.

As described above in detail, this invention provides various polypeptides that have activities of human caldesmon and also provides genes that code for these polypeptides. It is possible to produce both on a large scale by the use of genetic engineering, and said genes and polypeptides will be useful in diagnosis and other fields of medicine, and in biochemical research, as well. The functional unit of human caldesmon, 116AA, can be used as a material for chimera proteins and peptide transfer, and the DNA with SEQ ID NO:7 in the sequence listing will be of use in the fields of genetic engineering and protein engineering.

EXAMPLES

Below, this invention will be explained in further detail with reference to examples, but this invention is not to be taken to be limited to these examples.

Example 1

Cloning of cDNA that codes for the polypeptide of human caldesmon from HeLa cells

1-1. Construction of a cDNA library

The total RNA of HeLa cells was obtained by the method of guanidium-cesium chloride (Biochemistry, 18, 5294-5299, 1979), and poly(A)⁺ RNA was isolated by column chromatography with oligo-(dT)cellulose. The purified poly(A)⁺ RNA was used as a template and oligo-(dT) was used as the primer in the synthesis of cDNA by the method of Gubler-Hoffmann (Gene, 25, 263-269, 1983). The ends of the cDNA were blunted with T4 DNA polymerase, and the EcoRI site in the cDNA was methylated with EcoRI methylase. Next, cDNA was ligated to an EcoRI linker d(pGGAATTCC)! with use of T4 DNA ligase, and cDNA with both ends having an EcoRI was constructed by EcoRI digestion. This cDNA was ligated with the EcoRI arm of λgt11 (Stratagene), and a cDNA library was made by in vitro packaging with GigapackII Gold (Stratagene).

1-2. Screening of the cDNA library

Plaques of phages with use of E. coli Y1090 as the host cells were formed. To do this Y1090 cells were cultured overnight at 37° C., in L medium that contained 0.02% maltose, and the cells were collected by centrifugation. The cells harvested were suspended in 10 mM MgSO₄ and were added into a suspension of phage solution and kept at 37° C. for 15 minutes to allow attachment of the phages to the host cells. To this mixture was added a mixture of soft agar (L medium containing 10 mM MgSO₄ with agarose added to the final concentration of 0.6%, which medium was autoclaved and then cooled to and kept at 50° C,), and this new mixture was spread on L plates (below, this step is referred to as "plating"). The plates were kept at 42° C. for 3 hours, and after formation of plaques, a nylon membrane (HYBOND-N; Amersham) treated with 100 mM IPTG was placed on the plate and the whole was kept at 37° C. for 3 hours. The nylon membrane was removed from the plate and immunologically screened with rabbit anti-caldesmon antibodies obtained by the immunization of a rabbit with 35 kDa fragment of chicken caldesmon obtained by the digestion with α-chymotrypsin (J. Biochem., 102, 1065-1073, 1987) and with F(ab')₂ fragment of goat anti-rabbit immunoglobulin antibody labelled with alkaline phosphatase as the second antibody. Clones that were stained with nitro blue tetrazolium (NBT) and 5-bromo-4-chloro-3-indolylphosphate (BCIP) were positive. Several positive signals were obtained from among the 250,000 clones. The plaques that corresponded to the positive signals were cut out together with agar and suspended in 500 μl of SM solution (50 mM Tris-HCl, 100 mM NaCl, and 10 mM MgSO₄, pH 7.5), plated after being diluted appropriately, and screened again as described above, by which procedure it was possible to isolate four phage clones independently.

1-3. Preparation of recombinant λgt11 DNA

Next, 10⁵ of the cloned phages were plated, and the plates were kept first at 42° C. for 3 hours and then at 37° C. overnight. After this, 15 ml of SM solution and a few drops of chloroform were added and the plates were kept at room temperature for 30 minutes. The upper layer of soft agar with the SM solution was removed and put into a centrifuge tube, and centrifuged at 3000 r.p.m for 15 minutes. The supernatant was added 40% (w/v) polyethylene glycol 6000, so that the final concentration of the polyethylene glycol was 10%, and mixed throughly. The mixture was kept at 4° C. for 1 hour and centrifuged at 3000 r.p.m. for 15 minutes. The supernatant was discarded and the precipitate was suspended with SM solution and then left at 4° C. (below, this method is referred to as the plate lysate method). By this procedure, a phage solution with the titer of 1×10¹⁰ pfu/ml was obtained.

E. coli Y1090 cells that had been cultured overnight in L medium were harvested and suspended in 10 mM MgSO₄. The cell suspension was mixed with the phage solution mentioned above at multiplicity of infection (moi) of 0.01, and the mixture were left at 37° C. for 15 minutes. The mixture of phages and E. coli cells was used to inoculate L medium that contained 10 mM MgSO₄, and continued to be cultured at 37° C. until a considerable amount of bacterial debris remained after lysis. To the culture, sodium chloride was added to the final concentration of 0.5M, chloroform was added to the final concentration of 0.5%, and the mixture was stirred at 37° C. for 15 minutes. The supernatant obtained by centrifugation was mixed with polyethylene glycol 6000 added to the final concentration of 10% (w/v), and the mixture was left overnight at 4° C. The phage precipitate obtained by centrifugation was dissolved in TM solution (50 mM Tris-HCl and 10 mM MgSO₄, pH 7.8), and by the method of glycerol step-gradient ultracentrifugation (T. Maniastis et al., Molecular cloning: A laboratory manual, pp. 83-84, Cold Spring Harbor Laboratory, 1982), the phages were purified.

The phages obtained were suspended in TM solution, and DNase I and RNaseA were added to the suspension, which was then kept at 37° C. for 30 minutes before the addition of EDTA, Proteinase K (Sigma), and SDS to the concentrations of 20 mM, 50 μg/ml, and 0.5%, respectively. After then the mixture was kept at 65° C. for 1 hour. Phenol extraction was done, followed by extraction with diethyl ether, and to the aqueous layer, a 1/10 volume of 5M sodium chloride and two volumes of ice-cold ethanol were added, which caused precipitation of the DNA (below, this step is referred to as ethanol precipitation). The mixture was centrifuged and the precipitated DNA was collected, after which it was washed in 70% ethanol and dried before being dissolved in 100 μl of TE solution (10 mM Tris-HCl and 1 mM EDTA, pH 8.0).

1-4. Identification of DNA sequence of the inserted fragment

The recombinant λgt11 DNA prepared as described above was digested with EcoRI, and the inserted fragments were isolated and purified. Then they were cloned at the EcoRI site of M13mp18RF DNA. The recombinant M13mp18RF DNA was digested with SalI and SphI, and by use of exonuclease III (Course in Biochemical Experimentation 1, Methods in genetic research I, pp. 186-200, 1986), a mutant was constructed that lacked various length of sequence at the 5'-side of the SalI end. Another deletion mutant was prepared that was of recombinant M13mp18RF DNA that had the fragment insertion in the opposite orientation. Then E. coli JM109 cells were transformed with each mutant DNA derived from the recombinant M13mp18 and single-stranded DNA was isolated from the M13mp18 derivatives. Its DNA sequence was identified by the dideoxy-mediated chain-termination method, and the amino acid sequence was deduced from the DNA sequence.

Restriction maps of these clones are shown in FIG. 1. The ATG and TGA shown in the FIG. 1 are the initiation codon and the termination codon for translation, respectively.

The results of identification of the DNA sequence suggested that HeLa cells had two kinds of caldesmon. This was checked by use of the PCR. To do this, with primer 6 (shown in FIG. 1 as P6) indicated as SEQ ID NO:29 in the Sequence Listing the first strand of cDNA was synthesized. With both primer 5 (shown in FIG. 1 as P5) indicated as SEQ ID NO:30 of the Sequence Listing and primer 4 (shown in FIG. 1 as P4) indicated as SEQ ID NO:31 in the Sequence Listing as primer 4, the PCR was done, and two kinds of PCR product A (A-1 and A-2) were obtained. Then with primer 5 and primer 28 (shown in FIG. 1 as P28) indicated as SEQ ID NO:32, the PCR was done, and two kinds of PCR product B (B-1 and B-2) were obtained. Restriction maps of PCR products A (A-1 and A-2) and B (B-1 and B-2) are shown in FIG. 1. The DNA sequences of these PCR products were identified by the dideoxy method, and it was confirmed that there were two kinds of caldesmon. The caldesmon of higher molecular weight was composed of the 558 amino acid residues shown as SEQ ID NO:6 in the Sequence Listing, and the caldesmon of lower molecular weight was composed of the 532 amino acid residues shown as SEQ ID NO:5 in the Sequence Listing and was exactly the same as that of higher molecular weight except that it lacked amino acid residues 202 to 227 of the larger caldesmon. The caldesmon of higher molecular weight was named type I, and the caldesmon of lower molecular weight was named type II. The DNA sequence of type I is that shown as SEQ ID NO:12 in the Sequence Listing, and the DNA sequence of type II is that shown as SEQ ID NO:11. Their cDNA sequences are shown as SEQ ID NOS:14 and 13, respectively, in the Sequence Listing.

Example 2 Construction of expression vectors for types I and II

2-1. Construction of a plasmid that coded for type I caldesmon.

A cDNA fragment of about 800 bp that coded for the N-terminal region of type I caldesmon was obtained by EcoRI digestion of λgt11HS3CaDN1, and cloned at the EcoRI site of M13mp18RF DNA, giving M13mp18HS3CaDN1. This M13mp18HS3CaDN1 was digested with Nsp(7524)I, blunt-ended with T4 DNA polymerase, and digested with EcoRI, giving cDNA fragments of about 790 bp. This cDNA fragment was cloned in plasmid pTV118N which was digested with NcoI, treated with Klenow fragment (large fragment of E. coli DNA polymerase I), and digested with EcoRI, giving pTV118NHS3CaDN1.

cDNA fragments about 1.3 kbp long that coded for the C-terminal region of caldesmon were obtained by EcoRI digestion of λgt11HS3CaD25, and cloned at the EcoRI site of M13mp18RF DNA, giving M13mp18HS3CaDC. Then M13mp18HS3CaDC was digested with EcoRI and SacI, giving fragments of cDNA about 940 bp long, which were into pTV118NHS3CaDN1 between the EcoRI and SacI sites , giving plasmid pTV118NHS3CaD1, which coded for type I caldesmon.

Cells of E. coli JM109 into which pTV118NHS3CaD1 had been introduced were designated E. coli JM109/pTV118NHS3CaD1, and deposited at the Fermentation Research Institute of the Agency of Industrial Science and Technology, Japan, under FERM BP-3673.

2-2. Construction of a plasmid that coded for type II caldesmon

T4 DNA polymerase was used to make blunt ends on PCR product A-2, and the product was cloned at the SmaI site of M13mp18RF DNA so as to be in the same direction in the PCR product A-2 as in the M13mp18 lacZ gene. Then cloned M13mp18PCRA-2 with the plasmid A-2 as an insertion was constructed. M13mp18PCRA-2 was digested with Nsp(7524)I, and after its ends were made blunt with T4 DNA polymerase, it was digested at the multicloning site that originated from M13mp18 with XbaI, and cDNA fragments about 1.6 kbp long were obtained. These were cloned into pTV118N that had been digested with NcoI, treated with Klenow fragment, and then digested with XbaI, giving pTV118NHS3CaD2 PCRA-2. This was digested with EcoRI and HindIII, and ligated with the cDNA fragments about 940 bp long that code for the C-terminal region of the caldesmon obtained by digestion of pTV118NHS3CaD1 with EcoRI and HindIII. The product was designated plasmid pTV118NHS3CaD2, which codes for type II caldesmon.

E. coli cells into which pTV118NHS3CaD2 had been introduced were designated E. coli JM109/pTV118NHS3CaD2, and deposited at the Fermentation Research Institute of the Agency of Industrial Science and Technology, Japan, under FERM P-12013.

Example 3 Construction of a plasmid that expresses human caldesmon polypeptide

3-1. Construction of a plasmid that codes for 312AA

A primer with SEQ ID NO:15 of the Sequence Listing and with the NcoI recognition sequence at its 5'-end and a primer with SEQ ID NO:16 and with the SacI recognition sequence at its 5'-end were synthesized with a DNA synthesizer and purified. Said primers were used together with pTV118NHS3CaD1 as the template in the PCR to amplify DNA that had the DNA sequence shown as SEQ ID NO:10 in the Sequence Listing within its sequence.

Next, this amplification product was digested with NcoI and SacI and extracted with buffered phenol, so that its enzyme activity was lost. For the removal of an excess of dNTP in the PCR reaction mixture, and DNA concentration, the DNA was caused to precipitate with ammonium sulfate and isopropyl alcohol. The precipitated DNA was dissolved with TE solution (10 mM Tris-HCl, pH 8.0, and 1 mM EDTA), and ligated with pTV118N that had been digested with both NcoI and SacI, giving the plasmid pTHCD247, which codes for 312AA.

E. coli JM109 cells into which pTHCD247 had been introduced were designated E. coli JM109/pTHCD247, and deposited at the Fermentation Research Institute of the Agency of Industrial Science and Technology, Japan, under FERM BP-3671.

3-2. Construction of a plasmid that codes for 118AA

A primer with SEQ ID NO:19 of the Sequence Listing and with the NcoI recognition sequence at its 5'-end and a primer with SEQ ID NO:18 and with the EcoRI recognition sequence at its 5'-end were synthesized with a DNA synthesizer and purified. Said primers were used together with pTV118NHS3CaD1 as the template in the PCR to amplify DNA that had the DNA sequence shown as SEQ ID NO:8 in the Sequence Listing within its sequence.

Next, this amplification product was digested with NcoI and EcoRI, and treated as in section 3-1 above before being ligated with pTV118N that had been digested with both NcoI and EcoRI, giving the plasmid pTHCD443, which codes for 118AA.

E. coli JM109 cells into which pTHCD443 had been introduced were designated E. coli JM109/pTHCD443, and deposited at the Fermentation Research Institute of the Agency of Industrial Science and Technology, Japan, under FERM BP-3672.

3-3. Construction of a plasmid that codes for 122AA

A primer with SEQ ID NO:17 of the Sequence Listing and with the NcoI recognition sequence and the factor Xa recognition sequence at its 5'-end was synthesized with a DNA synthesizer and purified. Next, said primer and the primer described above as having SEQ ID NO:18 were used together with pTV118NHS3CaD1 as the template in the PCR to amplify DNA that had the DNA sequence shown as SEQ ID NO:9 in the Sequence Listing within its sequence.

Next, this amplification product was digested with NcoI and EcoRI, and ligated with pTV118N, which had also been digested both NcoI and EcoRI, giving a plasmid that coded for 122AA. Said plasmid was designated pTHCDXa443, and cells of E. coli JM109 into which the plasmid was introduced were named E. coli JM109/pTHCDXa443.

3-4. Construction of a plasmid that codes for 90AA

A primer with SEQ ID NO:24 of the Sequence Listing and with the NcoI recognition sequence at its 5'-end and a primer with SEQ ID NO:21 and the EcoRI recognition sequence at its 5'-end were synthesized with a DNA synthesizer and purified. Said primers were used together with pTV118NHS3CaD1 as the template in the PCR to amplify DNA that had the DNA sequence shown as SEQ ID NO:26 in the Sequence Listing within it. Next, this amplification product was digested with NcoI and EcoRI, and ligated with pTV118N that had been digested with NcoI and EcoRI, giving the plasmid pTHCD451, which codes for 90AA.

E. coli JM109 cells into which pTHCD451 had been introduced were designated E. coli JM109/pTHCD451.

3-5. Construction of a plasmid that codes for 94AA

A primer with SEQ ID NO:20 of the Sequence Listing and with the NcoI recognition sequence and the factor Xa recognition sequence at its 5'-end was synthesized with a DNA synthesizer and purified. Said primer and the primer described above as having SEQ ID NO:21 were used together with pTV118NHS3CaD1 as the template in the PCR to amplify DNA that had the DNA sequence shown as SEQ ID NO:23 in the Sequence Listing within its sequence.

Next, this amplification product was digested with NcoI and EcoRI, and ligated with pTV118N, which had also been digested both NcoI and EcoRI, giving a plasmid that coded for 94AA. Said plasmid was designated pTHCDXa451, and cells of E. coli JM109 into which the plasmid was introduced were named E. coli JM109/pTHCDXa451.

Example 4 Expression of human caldesmon polypeptide in E. coli

4-1. Expression of type I in E. coli

Cells of E. coli JM109 which carried pTV118NHS3CaD1 (FERM BP-3673) were used to inoculate 5 ml of L medium that contained 50 μg/ml ampicillin, and the culture was incubated with shaking overnight at 37° C. This seed culture was used to inoculate 500 ml of L medium with the same concentration of ampicillin, and the culture was incubated with shaking overnight at 37° C. When the cells reached the mid-logarithmic stage of growth, IPTG was added to the final concentration of 2 mM, and culture was continued for 17 hours more before the cells were harvested.

The cells were suspended in E solution (0.3M KCl, 0.5 mM EGTA, 0.5 mM MgCl₂, 0.5 mM dithiothreitol (DTT), 0.3 mM PMSF, and 50 mM Tris-HCl, pH 7.0), and the suspension was sonicated to disrupt the cells. The suspension was centrifuged and the supernatant was incubated at 95° C. for 5 minutes and then cooled. This was centrifuged and the precipitate was removed. The supernatant was dialyzed against C1 solution (120 mM NaCl, 0.1 mM EGTA, 0.2 mM DTT, and 10 mM Tris-HCl, pH 7.3). The inner solution during dialysis was passed through a column of DEAE Toyopearl 650M equilibrated with C1 solution. To the pass-through fraction, calcium chloride was added to the final concentration of 5 mM, the mixture was passed through a column of Sepharose 4B (Pharmacia LKB) coupled with calmodulin which equilibrated with C2 solution (70 mM NaCl, 0.2 mM CaCl₂, 0.1 mM DTT, and 10 mM Tris-HCl, pH 7.5) as described elsewhere (Journal of Biochemistry, 102, 1065-1073, 1987), and type I purified was extracted with C2 solution that contained 1.2 mM EGTA and purified.

4-2. Expression of type II in E. coli

Cells of E. coli JM109 which carried pTV118NHS3CaD2 (FERM P-12013) were cultured under the same conditions as in the example of 4-1 above, extracted, heated, put on a column of resin for affinity chromatography with calmodulin, and purified, giving type II.

4-3. Expression of 312AA, 122AA, 118AA, 94AA, and 90AA in E. coli

Cells of E. coli JM109/pTHCD247 (FERM BP-3671), of E. coli JM109/pTHCDXa443, of E. coli JM109/pTHCD443 (FERM BP-3672), of E. coli JM109/pTHCDXa451 and of E. coli JM109/pTHCD451 were cultured under the same conditions as in the example of 4-1 above, and purification gave 312AA, 122AA, 118AA, 94AA, and 90AA.

4-4. Preparation of 116AA and 88AA

122AA and 94AA were treated with restriction proteinase factor Xa (Takara Shuzo Co., Ltd.) for 5 hours at 37° C., and a column of calmodulin-Sepharose was used to purify 116AA and 88AA from the digest.

Example 5 Assay of physiological activity

5-1. Assay of calmodulin-binding activity and actin-binding activity

For the assay of calmodulin-binding activity, polypeptides purified as described above were mixed with 10 volumes of CaB buffer (10 mM Tris-HCl, pH 7.5, 100 mM KCl, 0.1 mM DTT, and 0.2 mM CaCl₂), and put on a column of Sepharose 4B (Pharmacia) coupled with calmodulin which equilibrated with the same buffer. This allowed binding of the polypeptide with calmodulin. After the column was washed with a large amount of the same buffer, the bound polypeptide was eluted from the column with the same buffer, except that 1 mM EGTA was added to the buffer instead of CaCl₂. A portion of each fraction was put on an SDS-polyacrylamide gel and electrophoresed, and the presence of the polypeptide in question was confirmed. For the assay of actin-binding activity, the co-precipitation method (BBRC, 132, 645-651, 1985, and Journal of Biochemistry, 102, 1065-1073, 1987) was used.

To 100 μl of a 2 mg/ml solution of monomeric actin prepared by the Staub method (Journal of Biological Chemistry, 188, 559 (1959)), 20 μl of 10 mM ATP, 40 μl of 5× buffer (50 mM Tris-HCl, pH 7.5, 500 mM KCl, 1 mM CaCl₂, and 0.5 mM DTT), 20 μl purified polypeptide, and water were added for a total volume of 196 μl, and then 4 μl of 100 mM MgCl₂ was added. The mixture was left at room temperature for 1 hour. By the addition of MgCl₂, the monomeric actin was polymerized and became filamentous actin. After the polymerization, the mixture was centrifuged at 200,000×g for 30 minutes. The monomeric actin remained in the supernatant, whereas the filamentous actin and proteins bound with it (the polypeptide of this invention) precipitated together. The supernatant and the precipitate were separately electrophoresed on SDS-polyacrylamide gels, and the binding of the polypeptide with actin was confirmed.

The results shown in the Table 1 above, were that the polypeptide that contained 116AA in its sequence had both kinds of activity.

5-2. Assay of tropomyosin-binding activity and inhibition of actomyosin ATPase activity

The ability of type I, type II, 312AA, 122AA, 118AA, and 116AA which have the activities described in 5-1, to bind with tropomyosin and to inhibit actomyosin ATPase activity was measured.

For the assay of tropomyosin-binding activity, an affinity column with tropomyosin was prepared by the coupling of tropomyosin with Sepharose 4B (Pharmacia) activated with CNBr. The column was equilibrated with TMB buffer (10 mM Tris-HCl, pH 7.0, 2 mM MgCl₂, and 0.5 mM DTT), and a solution of a purified polypeptide diluted. 10 times in the same buffer was put on the column to allow binding of the polypeptide to the column. The column was washed thoroughly with the same buffer, and after substances that had bound nonspecifically were removed in this way, elution was done with slowly increasing concentrations of potassium chloride. A portion of each fraction was put on an SDS-polyacrylamide gel and electrophoresed, and the polypeptide in question was found in the fractions that had been eluted with potassium chloride at the concentrations of 40 to 60 mM.

For the assay of the inhibition of actomyosin-ATPase by the polypeptide of this invention, first, phosphorylated myosin (final concentration, 0.2 mg/ml), actin (final concentration, 0.1 mg/ml), and tropomyosin (final concentration, 30 μg/ml) were added to a buffer that contained 20 mM imidazole-HCl (pH 7.2), 0.1 mM DTT, 100 mM KCl, 2 mM MgCl₂, 1 mM ATP, and 0.1 mM CaCl₂. Then the purified polypeptide was added and the mixture was kept at 30° C. for 5 minutes. After this reaction time, trichloroacetic acid was added to the final concentration of 10% to stop the reaction. The reaction mixture was centrifuged, and the amount of free phosphate in the supernatant was assayed by the method of Youngburg (BBRC, 132, 645-651, 1985). The inhibition caused by the polypeptide was found from the difference in the amount of free phosphate related by ATPase in the presence and absence of the polypeptide. Both of these activities were found in type I, type II, 312AA, 122AA, 118AA, and 116AA. The functional unit of human caldesmon polypeptide was found to be 116AA.

Results of the invention

As shown by the results described above, this invention provides the complete amino acid sequence of human caldesmon, and the complete DNA sequence that codes for human caldesmon, and it identifies the functional unit of caldesmon activity. These genes and the polypeptides are useful in the fields of biochemistry, medicine, including diagnosis, and the like.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 32                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        ArgLeuGluGlnTyrThrSerAlaIleGluGlyThrLysSerAla                                  151015                                                                         LysProThrLysProAlaAlaSerAspLeuProValProAlaGlu                                  202530                                                                         GlyValArgAsnIleLysSerMetTrpGluLysGlyAsnValPhe                                  354045                                                                         SerSerProThrAlaAlaGlyThrProAsnLysGluThrAlaGly                                  505560                                                                         LeuLysValGlyValSerSerArgIleAsnGluTrpLeuThrLys                                  657075                                                                         ThrProAspGlyAsnLysSerProAlaProLysProSerAspLeu                                  808590                                                                         ArgProGlyAspValSerSerLysArgAsnLeuTrpGluLysGln                                  95100105                                                                       SerValAspLysValThrSerProThrLysVal                                              110115                                                                         (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetAlaArgLeuGluGlnTyrThrSerAlaIleGluGlyThrLys                                  151015                                                                         SerAlaLysProThrLysProAlaAlaSerAspLeuProValPro                                  202530                                                                         AlaGluGlyValArgAsnIleLysSerMetTrpGluLysGlyAsn                                  354045                                                                         ValPheSerSerProThrAlaAlaGlyThrProAsnLysGluThr                                  505560                                                                         AlaGlyLeuLysValGlyValSerSerArgIleAsnGluTrpLeu                                  657075                                                                         ThrLysThrProAspGlyAsnLysSerProAlaProLysProSer                                  808590                                                                         AspLeuArgProGlyAspValSerSerLysArgAsnLeuTrpGlu                                  95100105                                                                       LysGlnSerValAspLysValThrSerProThrLysVal                                        110115                                                                         (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 122 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        MetAlaIleGluGlyArgArgLeuGluGlnTyrThrSerAlaIle                                  151015                                                                         GluGlyThrLysSerAlaLysProThrLysProAlaAlaSerAsp                                  202530                                                                         LeuProValProAlaGluGlyValArgAsnIleLysSerMetTrp                                  354045                                                                         GluLysGlyAsnValPheSerSerProThrAlaAlaGlyThrPro                                  505560                                                                         AsnLysGluThrAlaGlyLeuLysValGlyValSerSerArgIle                                  657075                                                                         AsnGluTrpLeuThrLysThrProAspGlyAsnLysSerProAla                                  808590                                                                         ProLysProSerAspLeuArgProGlyAspValSerSerLysArg                                  95100105                                                                       AsnLeuTrpGluLysGlnSerValAspLysValThrSerProThr                                  110115120                                                                      LysVal                                                                         (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 312 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        MetAspArgLysLysGlyPheThrGluValLysSerGlnAsnGly                                  151015                                                                         GluPheMetThrHisLysLeuLysHisThrGluAsnThrPheSer                                  202530                                                                         ArgProGlyGlyArgAlaSerValAspThrLysGluAlaGluGly                                  354045                                                                         AlaProGlnValGluAlaGlyLysArgLeuGluGluLeuArgArg                                  505560                                                                         ArgArgGlyGluThrGluSerGluGluPheGluLysLeuLysGln                                  657075                                                                         LysGlnGlnGluAlaAlaLeuGluLeuGluGluLeuLysLysLys                                  808590                                                                         ArgGluGluArgArgLysValLeuGluGluGluGluGlnArgArg                                  95100105                                                                       LysGlnGluGluAlaAspArgLysLeuArgGluGluGluGluLys                                  110115120                                                                      ArgArgLeuLysGluGluIleGluArgArgArgAlaGluAlaAla                                  125130135                                                                      GluLysArgGlnLysMetProGluAspGlyLeuSerAspAspLys                                  140145150                                                                      LysProPheLysCysPheThrProLysGlySerSerLeuLysIle                                  155160165                                                                      GluGluArgAlaGluPheLeuAsnLysSerValGlnLysSerSer                                  170175180                                                                      GlyValLysSerThrHisGlnAlaAlaIleValSerLysIleAsp                                  185190195                                                                      SerArgLeuGluGlnTyrThrSerAlaIleGluGlyThrLysSer                                  200205210                                                                      AlaLysProThrLysProAlaAlaSerAspLeuProValProAla                                  215220225                                                                      GluGlyValArgAsnIleLysSerMetTrpGluLysGlyAsnVal                                  230235240                                                                      PheSerSerProThrAlaAlaGlyThrProAsnLysGluThrAla                                  245250255                                                                      GlyLeuLysValGlyValSerSerArgIleAsnGluTrpLeuThr                                  260265270                                                                      LysThrProAspGlyAsnLysSerProAlaProLysProSerAsp                                  275280285                                                                      LeuArgProGlyAspValSerSerLysArgAsnLeuTrpGluLys                                  290295300                                                                      GlnSerValAspLysValThrSerProThrLysVal                                           305310                                                                         (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 532 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        MetLeuGlyGlySerGlySerHisGlyArgArgSerLeuAlaAla                                  151015                                                                         LeuSerGlnIleAlaTyrGlnArgAsnAspAspAspGluGluGlu                                  202530                                                                         AlaAlaArgGluArgArgArgArgAlaArgGlnGluArgLeuArg                                  354045                                                                         GlnLysGlnGluGluGluSerLeuGlyGlnValThrAspGlnVal                                  505560                                                                         GluValAsnAlaGlnAsnSerValProAspGluGluAlaLysThr                                  657075                                                                         ThrThrThrAsnThrGlnValGluGlyAspAspGluAlaAlaPhe                                  808590                                                                         LeuGluArgLeuAlaArgArgGluGluArgArgGlnLysArgLeu                                  95100105                                                                       GlnGluAlaLeuGluArgGlnLysGluPheAspProThrIleThr                                  110115120                                                                      AspAlaSerLeuSerLeuProSerArgArgMetGlnAsnAspThr                                  125130135                                                                      AlaGluAsnGluThrThrGluLysGluGluLysSerGluSerArg                                  140145150                                                                      GlnGluArgTyrGluIleGluGluThrGluThrValThrLysSer                                  155160165                                                                      TyrGlnLysAsnAspTrpArgAspAlaGluGluAsnLysLysGlu                                  170175180                                                                      AspLysGluLysGluGluGluGluGluGluLysProLysArgGly                                  185190195                                                                      SerIleGlyGluAsnGlnIleLysAspGluLysIleLysLysAsp                                  200205210                                                                      LysGluProLysGluGluValLysSerPheMetAspArgLysLys                                  215220225                                                                      GlyPheThrGluValLysSerGlnAsnGlyGluPheMetThrHis                                  230235240                                                                      LysLeuLysHisThrGluAsnThrPheSerArgProGlyGlyArg                                  245250255                                                                      AlaSerValAspThrLysGluAlaGluGlyAlaProGlnValGlu                                  260265270                                                                      AlaGlyLysArgLeuGluGluLeuArgArgArgArgGlyGluThr                                  275280285                                                                      GluSerGluGluPheGluLysLeuLysGlnLysGlnGlnGluAla                                  290295300                                                                      AlaLeuGluLeuGluGluLeuLysLysLysArgGluGluArgArg                                  305310315                                                                      LysValLeuGluGluGluGluGlnArgArgLysGlnGluGluAla                                  320325330                                                                      AspArgLysLeuArgGluGluGluGluLysArgArgLeuLysGlu                                  335340345                                                                      GluIleGluArgArgArgAlaGluAlaAlaGluLysArgGlnLys                                  350355360                                                                      MetProGluAspGlyLeuSerAspAspLysLysProPheLysCys                                  365370375                                                                      PheThrProLysGlySerSerLeuLysIleGluGluArgAlaGlu                                  380385390                                                                      PheLeuAsnLysSerValGlnLysSerSerGlyValLysSerThr                                  395400405                                                                      HisGlnAlaAlaIleValSerLysIleAspSerArgLeuGluGln                                  410415420                                                                      TyrThrSerAlaIleGluGlyThrLysSerAlaLysProThrLys                                  425430435                                                                      ProAlaAlaSerAspLeuProValProAlaGluGlyValArgAsn                                  440445450                                                                      IleLysSerMetTrpGluLysGlyAsnValPheSerSerProThr                                  455460465                                                                      AlaAlaGlyThrProAsnLysGluThrAlaGlyLeuLysValGly                                  470475480                                                                      ValSerSerArgIleAsnGluTrpLeuThrLysThrProAspGly                                  485490495                                                                      AsnLysSerProAlaProLysProSerAspLeuArgProGlyAsp                                  500505510                                                                      ValSerSerLysArgAsnLeuTrpGluLysGlnSerValAspLys                                  515520525                                                                      ValThrSerProThrLysVal                                                          530                                                                            (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 558 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        MetLeuGlyGlySerGlySerHisGlyArgArgSerLeuAlaAla                                  151015                                                                         LeuSerGlnIleAlaTyrGlnArgAsnAspAspAspGluGluGlu                                  202530                                                                         AlaAlaArgGluArgArgArgArgAlaArgGlnGluArgLeuArg                                  354045                                                                         GlnLysGlnGluGluGluSerLeuGlyGlnValThrAspGlnVal                                  505560                                                                         GluValAsnAlaGlnAsnSerValProAspGluGluAlaLysThr                                  657075                                                                         ThrThrThrAsnThrGlnValGluGlyAspAspGluAlaAlaPhe                                  808590                                                                         LeuGluArgLeuAlaArgArgGluGluArgArgGlnLysArgLeu                                  95100105                                                                       GlnGluAlaLeuGluArgGlnLysGluPheAspProThrIleThr                                  110115120                                                                      AspAlaSerLeuSerLeuProSerArgArgMetGlnAsnAspThr                                  125130135                                                                      AlaGluAsnGluThrThrGluLysGluGluLysSerGluSerArg                                  140145150                                                                      GlnGluArgTyrGluIleGluGluThrGluThrValThrLysSer                                  155160165                                                                      TyrGlnLysAsnAspTrpArgAspAlaGluGluAsnLysLysGlu                                  170175180                                                                      AspLysGluLysGluGluGluGluGluGluLysProLysArgGly                                  185190195                                                                      SerIleGlyGluAsnGlnGlyGluGluLysGlyThrLysValGln                                  200205210                                                                      AlaLysArgGluLysLeuGlnGluAspLysProThrPheLysLys                                  215220225                                                                      GluGluIleLysAspGluLysIleLysLysAspLysGluProLys                                  230235240                                                                      GluGluValLysSerPheMetAspArgLysLysGlyPheThrGlu                                  245250255                                                                      ValLysSerGlnAsnGlyGluPheMetThrHisLysLeuLysHis                                  260265270                                                                      ThrGluAsnThrPheSerArgProGlyGlyArgAlaSerValAsp                                  275280285                                                                      ThrLysGluAlaGluGlyAlaProGlnValGluAlaGlyLysArg                                  290295300                                                                      LeuGluGluLeuArgArgArgArgGlyGluThrGluSerGluGlu                                  305310315                                                                      PheGluLysLeuLysGlnLysGlnGlnGluAlaAlaLeuGluLeu                                  320325330                                                                      GluGluLeuLysLysLysArgGluGluArgArgLysValLeuGlu                                  335340345                                                                      GluGluGluGlnArgArgLysGlnGluGluAlaAspArgLysLeu                                  350355360                                                                      ArgGluGluGluGluLysArgArgLeuLysGluGluIleGluArg                                  365370375                                                                      ArgArgAlaGluAlaAlaGluLysArgGlnLysMetProGluAsp                                  380385390                                                                      GlyLeuSerAspAspLysLysProPheLysCysPheThrProLys                                  395400405                                                                      GlySerSerLeuLysIleGluGluArgAlaGluPheLeuAsnLys                                  410415420                                                                      SerValGlnLysSerSerGlyValLysSerThrHisGlnAlaAla                                  425430435                                                                      IleValSerLysIleAspSerArgLeuGluGlnTyrThrSerAla                                  440445450                                                                      IleGluGlyThrLysSerAlaLysProThrLysProAlaAlaSer                                  455460465                                                                      AspLeuProValProAlaGluGlyValArgAsnIleLysSerMet                                  470475480                                                                      TrpGluLysGlyAsnValPheSerSerProThrAlaAlaGlyThr                                  485490495                                                                      ProAsnLysGluThrAlaGlyLeuLysValGlyValSerSerArg                                  500505510                                                                      IleAsnGluTrpLeuThrLysThrProAspGlyAsnLysSerPro                                  515520525                                                                      AlaProLysProSerAspLeuArgProGlyAspValSerSerLys                                  530535540                                                                      ArgAsnLeuTrpGluLysGlnSerValAspLysValThrSerPro                                  545550555                                                                      ThrLysVal                                                                      (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 348 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        AGACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCG60                 GCAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAG120                AAAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGC180                TTGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAAC240                AAGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAGCGGAAC300                CTCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTT348                            (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 357 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        ATGGCCAGACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACA60                 AAGCCGGCAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATG120                TGGGAGAAAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACT180                GCTGGCTTGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGAT240                GGAAACAAGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAG300                CGGAACCTCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTTTGA357                   (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 369 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        ATGGCCATCGAGGGTAGGAGACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGC60                 GCAAAACCTACAAAGCCGGCAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAAC120                ATCAAGAGTATGTGGGAGAAAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCA180                AATAAGGAAACTGCTGGCTTGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACT240                AAAACCCCAGATGGAAACAAGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGAC300                GTATCCAGCAAGCGGAACCTCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACT360                AAGGTTTGA369                                                                   (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 939 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       ATGGATCGAAAGAAGGGATTTACAGAAGTTAAGTCGCAGAATGGAGAATTCATGACCCAC60                 AAACTTAAACATACTGAGAATACTTTCAGCCGCCCTGGAGGGAGGGCCAGCGTGGACACC120                AAGGAGGCTGAGGGCGCCCCCCAGGTGGAAGCCGGCAAAAGGCTGGAGGAGCTTCGTCGT180                CGTCGCGGGGAGACCGAGAGCGAAGAGTTCGAGAAGCTCAAACAGAAGCAGCAGGAGGCG240                GCTTTGGAGCTGGAGGAACTCAAGAAAAAGAGGGAGGAGAGAAGGAAGGTCCTGGAGGAG300                GAAGAGCAGAGGAGGAAGCAGGAGGAAGCCGATCGAAAACTCAGAGAGGAGGAAGAGAAG360                AGGAGGCTAAAGGAAGAGATTGAAAGGCGAAGAGCAGAAGCTGCTGAGAAACGCCAGAAG420                ATGCCAGAAGATGGCTTGTCAGATGACAAGAAACCATTCAAGTGTTTCACTCCTAAAGGT480                TCATCTCTCAAGATAGAAGAGCGAGCAGAATTTTTGAATAAGTCTGTGCAGAAAAGCAGT540                GGTGTCAAATCGACCCATCAAGCAGCAATAGTCTCCAAGATTGACAGCAGACTGGAGCAG600                TATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCGGCAGCCTCGGAT660                CTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAGAAAGGGAATGTG720                TTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGCTTGAAGGTAGGG780                GTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAACAAGTCACCTGCT840                CCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAGCGGAACCTCTGGGAAAAG900                CAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTTTGA939                                     (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1599 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       ATGCTGGGTGGATCCGGATCGCATGGAAGACGCAGCCTGGCCGCGCTCTCCCAAATCGCC60                 TACCAGAGGAATGACGATGATGAAGAGGAGGCAGCCCGGGAACGGCGCCGCCGAGCCCGA120                CAGGAACGGCTGCGGCAGAAGCAGGAGGAAGAATCCTTGGGACAGGTGACCGACCAGGTG180                GAGGTGAATGCCCAGAACAGTGTGCCTGACGAGGAGGCCAAGACAACCACCACAAACACT240                CAAGTGGAAGGGGATGATGAGGCCGCATTCCTGGAGCGCCTGGCTCGGCGTGAGGAAAGA300                CGCCAAAAACGCCTTCAGGAGGCTCTGGAGCGGCAGAAGGAGTTCGACCCAACAATAACA360                GATGCAAGTCTGTCGCTCCCAAGCAGAAGAATGCAAAATGACACAGCAGAAAATGAAACT420                ACCGAGAAGGAAGAAAAAAGTGAAAGTCGCCAAGAAAGATACGAGATAGAGGAAACAGAA480                ACAGTCACCAAGTCCTACCAGAAGAATGATTGGAGGGATGCTGAAGAAAACAAGAAAGAA540                GACAAGGAAAAGGAGGAGGAGGAAGAGGAGAAGCCAAAGCGAGGGAGCATTGGAGAAAAT600                CAGATCAAAGATGAAAAGATTAAAAAGGACAAAGAACCCAAAGAAGAAGTTAAGAGCTTC660                ATGGATCGAAAGAAGGGATTTACAGAAGTTAAGTCGCAGAATGGAGAATTCATGACCCAC720                AAACTTAAACATACTGAGAATACTTTCAGCCGCCCTGGAGGGAGGGCCAGCGTGGACACC780                AAGGAGGCTGAGGGCGCCCCCCAGGTGGAAGCCGGCAAAAGGCTGGAGGAGCTTCGTCGT840                CGTCGCGGGGAGACCGAGAGCGAAGAGTTCGAGAAGCTCAAACAGAAGCAGCAGGAGGCG900                GCTTTGGAGCTGGAGGAACTCAAGAAAAAGAGGGAGGAGAGAAGGAAGGTCCTGGAGGAG960                GAAGAGCAGAGGAGGAAGCAGGAGGAAGCCGATCGAAAACTCAGAGAGGAGGAAGAGAAG1020               AGGAGGCTAAAGGAAGAGATTGAAAGGCGAAGAGCAGAAGCTGCTGAGAAACGCCAGAAG1080               ATGCCAGAAGATGGCTTGTCAGATGACAAGAAACCATTCAAGTGTTTCACTCCTAAAGGT1140               TCATCTCTCAAGATAGAAGAGCGAGCAGAATTTTTGAATAAGTCTGTGCAGAAAAGCAGT1200               GGTGTCAAATCGACCCATCAAGCAGCAATAGTCTCCAAGATTGACAGCAGACTGGAGCAG1260               TATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCGGCAGCCTCGGAT1320               CTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAGAAAGGGAATGTG1380               TTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGCTTGAAGGTAGGG1440               GTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAACAAGTCACCTGCT1500               CCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAGCGGAACCTCTGGGAAAAG1560               CAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTTTGA1599                                    (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1677 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       ATGCTGGGTGGATCCGGATCGCATGGAAGACGCAGCCTGGCCGCGCTCTCCCAAATCGCC60                 TACCAGAGGAATGACGATGATGAAGAGGAGGCAGCCCGGGAACGGCGCCGCCGAGCCCGA120                CAGGAACGGCTGCGGCAGAAGCAGGAGGAAGAATCCTTGGGACAGGTGACCGACCAGGTG180                GAGGTGAATGCCCAGAACAGTGTGCCTGACGAGGAGGCCAAGACAACCACCACAAACACT240                CAAGTGGAAGGGGATGATGAGGCCGCATTCCTGGAGCGCCTGGCTCGGCGTGAGGAAAGA300                CGCCAAAAACGCCTTCAGGAGGCTCTGGAGCGGCAGAAGGAGTTCGACCCAACAATAACA360                GATGCAAGTCTGTCGCTCCCAAGCAGAAGAATGCAAAATGACACAGCAGAAAATGAAACT420                ACCGAGAAGGAAGAAAAAAGTGAAAGTCGCCAAGAAAGATACGAGATAGAGGAAACAGAA480                ACAGTCACCAAGTCCTACCAGAAGAATGATTGGAGGGATGCTGAAGAAAACAAGAAAGAA540                GACAAGGAAAAGGAGGAGGAGGAAGAGGAGAAGCCAAAGCGAGGGAGCATTGGAGAAAAT600                CAGGGAGAAGAGAAGGGAACTAAAGTGCAAGCTAAAAGAGAAAAGCTCCAAGAAGACAAG660                CCTACCTTCAAAAAAGAAGAGATCAAAGATGAAAAGATTAAAAAGGACAAAGAACCCAAA720                GAAGAAGTTAAGAGCTTCATGGATCGAAAGAAGGGATTTACAGAAGTTAAGTCGCAGAAT780                GGAGAATTCATGACCCACAAACTTAAACATACTGAGAATACTTTCAGCCGCCCTGGAGGG840                AGGGCCAGCGTGGACACCAAGGAGGCTGAGGGCGCCCCCCAGGTGGAAGCCGGCAAAAGG900                CTGGAGGAGCTTCGTCGTCGTCGCGGGGAGACCGAGAGCGAAGAGTTCGAGAAGCTCAAA960                CAGAAGCAGCAGGAGGCGGCTTTGGAGCTGGAGGAACTCAAGAAAAAGAGGGAGGAGAGA1020               AGGAAGGTCCTGGAGGAGGAAGAGCAGAGGAGGAAGCAGGAGGAAGCCGATCGAAAACTC1080               AGAGAGGAGGAAGAGAAGAGGAGGCTAAAGGAAGAGATTGAAAGGCGAAGAGCAGAAGCT1140               GCTGAGAAACGCCAGAAGATGCCAGAAGATGGCTTGTCAGATGACAAGAAACCATTCAAG1200               TGTTTCACTCCTAAAGGTTCATCTCTCAAGATAGAAGAGCGAGCAGAATTTTTGAATAAG1260               TCTGTGCAGAAAAGCAGTGGTGTCAAATCGACCCATCAAGCAGCAATAGTCTCCAAGATT1320               GACAGCAGACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACA1380               AAGCCGGCAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATG1440               TGGGAGAAAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACT1500               GCTGGCTTGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGAT1560               GGAAACAAGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAG1620               CGGAACCTCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTTTGA1677                  (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2137 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 12-1607                                                          (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="human caldesmon of lower                         molecular weight"                                                              (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 2025-2030                                                        (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="poly A signal"                                   (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 2121-2137                                                        (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="poly A site"                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       CAACTGCGGACATGCTGGGTGGATCCGGATCGCATGGAAGACGCAGCCTGGCCGCGCTCT60                 CCCAAATCGCCTACCAGAGGAATGACGATGATGAAGAGGAGGCAGCCCGGGAACGGCGCC120                GCCGAGCCCGACAGGAACGGCTGCGGCAGAAGCAGGAGGAAGAATCCTTGGGACAGGTGA180                CCGACCAGGTGGAGGTGAATGCCCAGAACAGTGTGCCTGACGAGGAGGCCAAGACAACCA240                CCACAAACACTCAAGTGGAAGGGGATGATGAGGCCGCATTCCTGGAGCGCCTGGCTCGGC300                GTGAGGAAAGACGCCAAAAACGCCTTCAGGAGGCTCTGGAGCGGCAGAAGGAGTTCGACC360                CAACAATAACAGATGCAAGTCTGTCGCTCCCAAGCAGAAGAATGCAAAATGACACAGCAG420                AAAATGAAACTACCGAGAAGGAAGAAAAAAGTGAAAGTCGCCAAGAAAGATACGAGATAG480                AGGAAACAGAAACAGTCACCAAGTCCTACCAGAAGAATGATTGGAGGGATGCTGAAGAAA540                ACAAGAAAGAAGACAAGGAAAAGGAGGAGGAGGAAGAGGAGAAGCCAAAGCGAGGGAGCA600                TTGGAGAAAATCAGATCAAAGATGAAAAGATTAAAAAGGACAAAGAACCCAAAGAAGAAG660                TTAAGAGCTTCATGGATCGAAAGAAGGGATTTACAGAAGTTAAGTCGCAGAATGGAGAAT720                TCATGACCCACAAACTTAAACATACTGAGAATACTTTCAGCCGCCCTGGAGGGAGGGCCA780                GCGTGGACACCAAGGAGGCTGAGGGCGCCCCCCAGGTGGAAGCCGGCAAAAGGCTGGAGG840                AGCTTCGTCGTCGTCGCGGGGAGACCGAGAGCGAAGAGTTCGAGAAGCTCAAACAGAAGC900                AGCAGGAGGCGGCTTTGGAGCTGGAGGAACTCAAGAAAAAGAGGGAGGAGAGAAGGAAGG960                TCCTGGAGGAGGAAGAGCAGAGGAGGAAGCAGGAGGAAGCCGATCGAAAACTCAGAGAGG1020               AGGAAGAGAAGAGGAGGCTAAAGGAAGAGATTGAAAGGCGAAGAGCAGAAGCTGCTGAGA1080               AACGCCAGAAGATGCCAGAAGATGGCTTGTCAGATGACAAGAAACCATTCAAGTGTTTCA1140               CTCCTAAAGGTTCATCTCTCAAGATAGAAGAGCGAGCAGAATTTTTGAATAAGTCTGTGC1200               AGAAAAGCAGTGGTGTCAAATCGACCCATCAAGCAGCAATAGTCTCCAAGATTGACAGCA1260               GACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCGG1320               CAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAGA1380               AAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGCT1440               TGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAACA1500               AGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCAGCAAGCGGAACC1560               TCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACTAAGGTTTGAGACAGTTCCA1620               GAAAGAACCCAAGCTCAAGACGCAGGACGAGCTCAGTTGTAGAGGGCTAATTCGCTCTTT1680               TGTATTTATGTTGATTTACTAAATTGGGTTCATTATCTTTTATTTTTCAATATCCCAGTA1740               AACCCATGTATATTATCACTATATTTAATAATCACAGCTAGAGATGTTCATGGTAAAAGT1800               ACTGCCTTTGCACAGGAGCCTGTTTCTAAAGAAACCCATGCTGTGAAATAGAGACTTTTC1860               TACTGATCATCATAACTCTGTATCTGAGCAGTGATACCAACCACATCTGAAGTCAACAGA1920               AGATCCAAGTTTAAAATTGCCTGCGGAATGTGTGCAGTATCTAGAAAAATGAACCGTAGT1980               TTTGTTTTTTTAAATACAGAAGTCATGTTGTTTCTGCACTTTATAATAAAGCATGGAAGA2040               AATTATCTTAGTAGGCAATTGTAACACTTTTTGAAAGTAACCCATTTCAGATTTGAAATA2100               CTGCAATAATGGTTGTCTTTAAAAAAAAAAAAAAAAA2137                                      (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2215 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 12-1685                                                          (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="human caldesmon of higher                        molecular weight"                                                              (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 2103-2108                                                        (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="poly A signal"                                   (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION: 2099-2215                                                        (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION: /note="poly A site"                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       CAACTGCGGACATGCTGGGTGGATCCGGATCGCATGGAAGACGCAGCCTGGCCGCGCTCT60                 CCCAAATCGCCTACCAGAGGAATGACGATGATGAAGAGGAGGCAGCCCGGGAACGGCGCC120                GCCGAGCCCGACAGGAACGGCTGCGGCAGAAGCAGGAGGAAGAATCCTTGGGACAGGTGA180                CCGACCAGGTGGAGGTGAATGCCCAGAACAGTGTGCCTGACGAGGAGGCCAAGACAACCA240                CCACAAACACTCAAGTGGAAGGGGATGATGAGGCCGCATTCCTGGAGCGCCTGGCTCGGC300                GTGAGGAAAGACGCCAAAAACGCCTTCAGGAGGCTCTGGAGCGGCAGAAGGAGTTCGACC360                CAACAATAACAGATGCAAGTCTGTCGCTCCCAAGCAGAAGAATGCAAAATGACACAGCAG420                AAAATGAAACTACCGAGAAGGAAGAAAAAAGTGAAAGTCGCCAAGAAAGATACGAGATAG480                AGGAAACAGAAACAGTCACCAAGTCCTACCAGAAGAATGATTGGAGGGATGCTGAAGAAA540                ACAAGAAAGAAGACAAGGAAAAGGAGGAGGAGGAAGAGGAGAAGCCAAAGCGAGGGAGCA600                TTGGAGAAAATCAGGGAGAAGAGAAGGGAACTAAAGTGCAAGCTAAAAGAGAAAAGCTCC660                AAGAAGACAAGCCTACCTTCAAAAAAGAAGAGATCAAAGATGAAAAGATTAAAAAGGACA720                AAGAACCCAAAGAAGAAGTTAAGAGCTTCATGGATCGAAAGAAGGGATTTACAGAAGTTA780                AGTCGCAGAATGGAGAATTCATGACCCACAAACTTAAACATACTGAGAATACTTTCAGCC840                GCCCTGGAGGGAGGGCCAGCGTGGACACCAAGGAGGCTGAGGGCGCCCCCCAGGTGGAAG900                CCGGCAAAAGGCTGGAGGAGCTTCGTCGTCGTCGCGGGGAGACCGAGAGCGAAGAGTTCG960                AGAAGCTCAAACAGAAGCAGCAGGAGGCGGCTTTGGAGCTGGAGGAACTCAAGAAAAAGA1020               GGGAGGAGAGAAGGAAGGTCCTGGAGGAGGAAGAGCAGAGGAGGAAGCAGGAGGAAGCCG1080               ATCGAAAACTCAGAGAGGAGGAAGAGAAGAGGAGGCTAAAGGAAGAGATTGAAAGGCGAA1140               GAGCAGAAGCTGCTGAGAAACGCCAGAAGATGCCAGAAGATGGCTTGTCAGATGACAAGA1200               AACCATTCAAGTGTTTCACTCCTAAAGGTTCATCTCTCAAGATAGAAGAGCGAGCAGAAT1260               TTTTGAATAAGTCTGTGCAGAAAAGCAGTGGTGTCAAATCGACCCATCAAGCAGCAATAG1320               TCTCCAAGATTGACAGCAGACTGGAGCAGTATACCAGTGCAATTGAGGGAACAAAAAGCG1380               CAAAACCTACAAAGCCGGCAGCCTCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACA1440               TCAAGAGTATGTGGGAGAAAGGGAATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAA1500               ATAAGGAAACTGCTGGCTTGAAGGTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTA1560               AAACCCCAGATGGAAACAAGTCACCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACG1620               TATCCAGCAAGCGGAACCTCTGGGAAAAGCAATCTGTGGATAAGGTCACTTCCCCCACTA1680               AGGTTTGAGACAGTTCCAGAAAGAACCCAAGCTCAAGACGCAGGACGAGCTCAGTTGTAG1740               AGGGCTAATTCGCTCTTTTGTATTTATGTTGATTTACTAAATTGGGTTCATTATCTTTTA1800               TTTTTCAATATCCCAGTAAACCCATGTATATTATCACTATATTTAATAATCACAGCTAGA1860               GATGTTCATGGTAAAAGTACTGCCTTTGCACAGGAGCCTGTTTCTAAAGAAACCCATGCT1920               GTGAAATAGAGACTTTTCTACTGATCATCATAACTCTGTATCTGAGCAGTGATACCAACC1980               ACATCTGAAGTCAACAGAAGATCCAAGTTTAAAATTGCCTGCGGAATGTGTGCAGTATCT2040               AGAAAAATGAACCGTAGTTTTGTTTTTTTAAATACAGAAGTCATGTTGTTTCTGCACTTT2100               ATAATAAAGCATGGAAGAAATTATCTTAGTAGGCAATTGTAACACTTTTTGAAAGTAACC2160               CATTTCAGATTTGAAATACTGCAATAATGGTTGTCTTTAAAAAAAAAAAAAAAAA2215                    (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 29 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       CAGACCATGGATCGAAAGAAGGGATTTAC29                                                (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       TACCGAGCTCAAACCTTAGTGGGGGAAGTG30                                               (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 47 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       CAGACCATGGCCATCGAGGGTAGGAGACTGGAGCAGTATACCAGTGC47                              (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       GCTCGAATTCAAACCTTAGTGGGGGAAGTG30                                               (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       CAGACCATGGCCAGACTGGAGCAGTATACC30                                               (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 42 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       CAGACCATGGCCATCGAGGGTAGGATTGAGGGAACAAAAAGC42                                   (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 26 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       GCTCGAATTCAGGATACGTCTCCTGG26                                                   (2) INFORMATION FOR SEQ ID NO:22:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 94 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                       MetAlaIleGluGlyArgIleGluGlyThrLysSerAlaLysPro                                  151015                                                                         ThrLysProAlaAlaSerAspLeuProValProAlaGluGlyVal                                  202530                                                                         ArgAsnIleLysSerMetTrpGluLysGlyAsnValPheSerSer                                  354045                                                                         ProThrAlaAlaGlyThrProAsnLysGluThrAlaGlyLeuLys                                  505560                                                                         ValGlyValSerSerArgIleAsnGluTrpLeuThrLysThrPro                                  657075                                                                         AspGlyAsnLysSerProAlaProLysProSerAspLeuArgPro                                  808590                                                                         GlyAspValSer                                                                   (2) INFORMATION FOR SEQ ID NO:23:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 285 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                       ATGGCCATCGAGGGTAGGATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCGGCAGCC60                 TCGGATCTTCCTGTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAGAAAGGG120                AATGTGTTTTCATCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGCTTGAAG180                GTAGGGGTTTCTAGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAACAAGTCA240                CCTGCTCCCAAACCTTCTGACTTGAGACCAGGAGACGTATCCTGA285                               (2) INFORMATION FOR SEQ ID NO:24:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                       CAGACCATGGCCATTGAGGGAACAAAAAGC30                                               (2) INFORMATION FOR SEQ ID NO:25:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 90 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                       MetAlaIleGluGlyThrLysSerAlaLysProThrLysProAla                                  151015                                                                         AlaSerAspLeuProValProAlaGluGlyValArgAsnIleLys                                  202530                                                                         SerMetTrpGluLysGlyAsnValPheSerSerProThrAlaAla                                  354045                                                                         GlyThrProAsnLysGluThrAlaGlyLeuLysValGlyValSer                                  505560                                                                         SerArgIleAsnGluTrpLeuThrLysThrProAspGlyAsnLys                                  657075                                                                         SerProAlaProLysProSerAspLeuArgProGlyAspValSer                                  808590                                                                         (2) INFORMATION FOR SEQ ID NO:26:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 273 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE: Human                                                    (vii) IMMEDIATE SOURCE: Hela Cell                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                       ATGGCCATTGAGGGAACAAAAAGCGCAAAACCTACAAAGCCGGCAGCCTCGGATCTTCCT60                 GTTCCTGCTGAAGGTGTACGCAACATCAAGAGTATGTGGGAGAAAGGGAATGTGTTTTCA120                TCCCCCACTGCAGCAGGCACACCAAATAAGGAAACTGCTGGCTTGAAGGTAGGGGTTTCT180                AGCCGTATCAATGAATGGCTAACTAAAACCCCAGATGGAAACAAGTCACCTGCTCCCAAA240                CCTTCTGACTTGAGACCAGGAGACGTATCCTGA273                                           (2) INFORMATION FOR SEQ ID NO:27:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 88 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                       IleGluGlyThrLysSerAlaLysProThrLysProAlaAlaSer                                  151015                                                                         AspLeuProValProAlaGluGlyValArgAsnIleLysSerMet                                  202530                                                                         TrpGluLysGlyAsnValPheSerSerProThrAlaAlaGlyThr                                  354045                                                                         ProAsnLysGluThrAlaGlyLeuLysValGlyValSerSerArg                                  505560                                                                         IleAsnGluTrpLeuThrLysThrProAspGlyAsnLysSerPro                                  657075                                                                         AlaProLysProSerAspLeuArgProGlyAspValSer                                        8085                                                                           (2) INFORMATION FOR SEQ ID NO:28:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 6 amino acids                                                      (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                       MetAlaIleGluGlyArg                                                             15                                                                             (2) INFORMATION FOR SEQ ID NO:29:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                       CTACAACTGAGCTCGTCCTG20                                                         (2) INFORMATION FOR SEQ ID NO:30:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 22 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: No                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                       ACATGCTGGGTGGATCCGGATC22                                                       (2) INFORMATION FOR SEQ ID NO:31:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                       GGAACTGTCTCAAACCTTAG20                                                         (2) INFORMATION FOR SEQ ID NO:32:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iv) ANTI-SENSE: Yes                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                       GTTTAAGTTTGTGGGTCATGAATTCTCC28                                                 __________________________________________________________________________ 

What we claim is:
 1. An isolated DNA encoding an actin- or calmodulin-binding polypeptide fragment of a human caldesmon protein, said DNA comprising a nucleotide sequence encoding at least the amino acid sequence shown in SEQ ID NO:1, wherein said caldesmon protein has the amino acid sequence shown in SEQ ID NO:5 or SEQ ID NO:6.
 2. The DNA according to claim 1, comprising a nucleotide sequence encoding the amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4.
 3. The DNA according to claim 1, which comprises the nucleotide sequence shown in SEQ ID NO:7.
 4. The DNA according to claim 1, which comprises a nucleotide sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9 and SEQ ID NO:10.
 5. An isolated DNA which is complementary to the DNA according to any one of claims 1, 2, 3, or
 4. 6. An isolated DNA which hybridizes under stringent conditions with the complement of the DNA according to any one of claims 1, 2, 3 or 4, said DNA coding for a human polypeptide that has calmodulin-binding activity and actin-binding activity.
 7. An isolated DNA encoding a chimeric protein, comprising the DNA according to any one of claims 1, 2, 3, or
 4. 8. A cDNA which codes for a human caldesmon protein having the amino acid sequence shown in SEQ ID NO:5 or SEQ ID NO:6.
 9. The cDNA according to claim 8, which has a nucleotide sequence shown in SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13 or SEQ ID NO:14.
 10. A recombinant vector comprising the DNA according to any one of claims 1, 2, 3, 4 or 7 or the cDNA according to any one of claims 8 or
 9. 11. A host cell transformed with the recombinant vector according to claim
 10. 12. A process for producing a human caldesmon protein or active fragment thereof, which comprises:culturing the host cell according to claim 11 under conditions to express the human caldesmon protein or active fragment thereof, and isolating the caldesmon protein or active fragment. 